IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 37, NO. 5, MAY 1989 



535 



mobile telephone system with directed retry," IEEE Trans. Com- 
mun. t vol. COM-34, Apr. 1986. 
[6] G. L. Choudhury and S. S. Rappaport, "Cellular communication 
schemes using generalized fixed channel assignment and collision type 
request channels," IEEE Trans. Veh. Technoi, vol. VT-31, May 
1982. 

[7] J. Karlsson, "A supplementary to a cellular mobile telephone system 
with load sharing," Dep. Commun. Syst. Lund Inst. Technol. Tech. 
Rep., 1986. 

[81 B. Wallstrom, "Congestion studies in telephone systems with overflow 
facilities," Ericsson Technics, no. 3, 1966, Annex 4. 



Efficient Computation of Most Probable States for 
Communication Networks with 
Multimode Components 

CHE-LIANG YANG AND PETER KUBAT 



Abstract—A general multimode model for jointly assessing the per- 
formance and reliability (perf ormability) of a communication network Is 
considered. The perf ormability is computed by averaging the perform- 
ance over the most probable network states. An efficient algorithm, 
superior to all previous algorithms in both speed and flexibility, is 
proposed to enumerate the most probable network states. The algorithm 
keeps generating the network states one by one, in order of decreasing 
probability, until some stopping rule (if specified) is met. 

I. Introduction 

In recent years, efforts [l]-[3] have been devoted to jointly 
assessing the performance and reliability of a communication/ 
computer network. In this approach, all the possible states of 
the network are enumerated, steady-state probability and 
network performance calculated for each state, and the overall 
performance (performability) calculated as the average net- 
work performance in the steady state. Due to the large 
(exponential) size of the state space, in practice, such 
assessment is limited to very small networks. 

To avoid the pitfall of the exponential size of the state space, 
Li and Silvester [4] proposed an approximation — to calculate 
the global performance over the most probable states 
of the network only. Since the network operates in these states 
most of the time, this suggests a good approximation of the 
network performance. In networks where component reliabil- 
ity is high, only a small number of states dominate the state 
space in terms of probability, and thus the calculation of the 
global performance is fast. The efficiency of the algorithm for 
generating most probable states [4] was improved by Lam and 
Li [6]. 

In a recent article, Chiou and Li [5] generalized the 
approach and introduced an algorithm for approximately 
calculating the reliability of communication networks with 
multimode components. Here the performance degradation of 
individual components is characterized by different modes. 
Chiou and Li's algorithm, named ORDER-Af, enumerates m 
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network states Sk* k = 1,2, • * •, m f in order of decreasing 
probability and then averages the network performance over 
these m most probable states. For the network with n 
components and N modes for each component, the complexity 
of their algorithm is 0(n 2 Nm + nNm log m). The algorithm 
requires the selected value of m to be predetermined. 

In order to achieve a prescribed so-called (I — e) coverage 
of the entire sample space ft, Chiou and Li would have to first 
guess the value of m, then run the algorithm to generate m 
most probable states, and finally check to see if m will satisfy 
the coverage requirement that X% ml P(s k ) S: 1 - e. If the 
requirement is not satisfied, the larger value of m is selected, 
and the algorithm will be again executed from the start and so 
on. 

In this paper, we present a new efficient algorithm that can 
enumerate the most probable states s k of the network with 
multimode components, in order of decreasing probability. 
The algorithm keeps generating states, one by one, in every 
0(nN) operations, until any suitable stopping criterion (if 
specified) is met. To identify O t , the minimal set of most 
probable states that satisfies (1 — «) coverage requirement, we 
can run the algorithm until the probability sum of the states 
generated so far is no less than (1 - e). It is shown that the 
complexity is 0(n-N- 10 ( |). Our algorithm does not need to 
guess the size of fl« in advance, and it is at Jeast n times faster 
than the one developed by Chiou and Li [5]. 

II. The Model— Assumptions and Notation 

Consider a communication network which is composed of n 
components (or, units) numbered by 1 , 2, • • • , n. For each / 
= 1,2, •",/!, 

1) unit / operates in one of Ni > 1 modes, called M iM Af f>2 , 
* * *, Mi tNi \ 

2) unit / operates in mode Mjj with probability pjj > 0, 
2^, Pi j = 1, independent of other units. 

Without loss of generality, we will assume that for each / = 
1* 2, • •, n, 

p iA >Pij, y=2, (i) 

Let the n vector x ~ (x it x 2 , * • * , x„) represent a possible state 
of the network units such that Xi = j\ \ ^ j ^ Nj if unit / is in 
mode Mi j. Clearly, there are n? a , N- t possible states for the 
network. Let P{x) denote the probability that the network is in 
state jc. Thus, P(x) = IT^ , p Kxr Define the minimal (1 - e) 
covered space Q ( as 

fl e = U|*=l. 2, m, J />(j,)<l-c<2 P(s k )} 

where the network states s kl k - 1, 2, n?_, /V, are 
assumed to be indexed so that P(s{) 2: P(s 2 ) > P(S}) £ ■ ■ • . 

To describe the algorithm which finds s,- one by one, in 
order of decreasing probability, we need the following 
definitions (see [7] for details). 

A (directed) tree is a directed graph such that there exists 
exactly one vertex, called the root, which no edges enter and 
there is a unique directed path from the root to every vertex. 

Let G = (K, £") be a tree where V and E represent the 
vertex set and edge set, respectively. If the directed edge (w, v) 
is in E, then « is called the father of i\ and u is a son of w. If 
there is a directed path from w to u, then u is an ancestor of v, 
and i>is a descendant of u. A vertex with no descendants is 
called a ieaf; otherwise, this vertex is called an internal 
vertex. A vertex u and all its descendants, denoted by T Uy is 
called a subtree of G. The level of a vertex u is the length 
(number of edges) of the path from the root to this vertex. 
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Clearly, the root of G is the unique vertex that is at level 0. 
The height of a tree is the length of the longest path from the 
root to a leaf. 

An ordered tree is a tree in which the sons of each vertex 
are ordered. When drawing an ordered tree, we assume that 
the sons of each vertex are ordered from left to right. T v is 
called the fth subtree of u if v is the ith son of w. 

III. An Algorithm for Identifying 0 ( 

First, we transform the problem of identifying Q f to a tree 
search problem. Given TV,- and p itJ for / = 1 , * • • , n and j — 1 , 
• • • , define an ordered tree G with height n so that each 
vertex at level /, 0 :£ / S n - 1 , has exactly 1 sons and 
each vertex at level n is a leaf. Clearly, this tree has U* iwX N { 
leaves. Define the address of a vertex u at level /, 1 £ / £ n, 
to be an / vector ( y t , y u • • • , yi) where y t = j if for the unique 
path, say a 0 {root), a u ' ' ' » 1. <* /(«). from root to w, a, is 
the yth son of x . The weight of vertex w, denoted as o>(u), is 
defined to be U'. mt p ityi . Note that there is one-to-one 
correspondence between the address of a leaf in G and a state 
in fi. Thus, the problem of finding Q ( is equivalent to 
identifying the addresses of the heaviest leaves in G such that 
their weight sum is greater than or equal to 1 - e. 

Example I: Consider a 3-unit system whose table of N { and 
Pij is shown in Table I. The corresponding ordered tree G is 
depicted in Fig. 1 where associated with each leaf is the 
address and weight. Observe that there are 12 leaves in G and 
each leaf corresponds to exactly one state of the system. □ 

To solve this problem, our basic strategy is quite simple- 
first identify the address of the heaviest leaf in G, then identify 
that of the second heaviest leaf, and so on until the weight sum 
of all leaves identified so far is greater than or equal to 1 — e. 
In other words, the address of the mth heaviest leaf of G will 
be identified at the /nth iteration of the algorithm. Note that 
since G is fully defined by the table of TV*/ and p iih we can 
conceive the structure of the whole G without having to create 
real tree structures to represent it. 

Lemma 1: For a vertex u of G at level /, the heaviest leaf in 
T u has weight 

/i 

• n 

<-/+ 1 

Proof: This follows directly from (1) and the definition 
of G. □ 
The basic operation of the algorithm is described as follows 
by induction. Suppose that we are at the beginning of the /nth 
iteration of the algorithm and that we have built a subgraph G ' 
of G with the vertex set corresponding to those leaves 
identified (in the first m - 1 iterations) and their ancestors. 
Furthermore, suppose that each internal vertex « (at level /) of 
G' is associated with an array W w [l], ***, WuWv"/+i] such 
that W u [i\ indicates the weight of the heaviest remaining (that 
is, not yet identified) leaf in the ith subtree of u in G. Let 
WirM = 0 if all the leaves in the ith subtree of u in G have 
been identified. With this, we can now identify the heaviest 
remaining leaf (in this case, the /nth heaviest leaf) of G by 
proceeding from the root of G' and determining to visit the 
Jtth son of the currently being visited vertex « at level / < n if 
W u [k\ = maxfWJIJ, W u [N Ui ]). If the kih son of u 
has not yet been created in G ' , then before visiting it, we have 
to first create a vertex u* as the k\h son of w, establish a link 
from u to w', and associate u' with a newly created array 
( W u , [1], • • • , W ut [N t +i]) where at this point, we can simply 
let W u ,[i] = «(M')'A + i./-n;_ /+ , P Jt i because of Lemma 1 
and the fact that no leaf in T u , of G has been identified. When 
finally a leaf (that is, a vertex at level /i), say i/, is reached, we 
can be sure that v corresponds to the heaviest remaining leaf of 
G. Suppose that a 0 (root) t a l9 • • a„_i, a n (v) is the unique 



TABLE 1 

THE DATA FOR THE SYSTEM IN EXAMPLE 1 



unit i 


Ni 


Pt.i 




P»,3 


1 


2 


0.7 


0.3 




2 


3 


0.5 


0.2 


0.3 


3 


2 


0.8 


0.2 






(1.1,1) (1,1,2) (1,2.1) (1,2,2) (1.3,1) (1.3.2) (2.1,1) (2.1,2) (2.2.1) (2.2,2) (2,3,1) (2,3.2) 
0.260 0.070 0.112 0.028 0.160 0.042 0.120 0.030 0.046 0.012 0.072 0016 

Fig. 1. The ordered tree G for Example 1 . 



path from the root to v and that ( ^i, y 2 , • * * , y a ) is the address 
of v. (The address of v can be determined as the algorithm 
traverses from the root to u). Note that after the leaf v has been 
identified, it is not necessary to update W B {2] f • * * for 

every internal vertex u in G' because only W fl( .[ j y J+ i], / = 0, 
•*- t /i - 1, can be affected. (To see this, observe that the 
content of each of a vertex should remain unchanged if v 
is not in the ith subtree of this vertex.) We can easily update 
wy^+il, i - 0, • • •, n - 1 by setting W mm _ x \y H \ = 0 and 
wy-K+il - max(^ ai>1 [lj, WVhW+j]), ' = n " 2 > 
• * * , 0, when backtracking, vertex by vertex, from v to the 
root of C. As soon as the backtracking is completed, the 
algorithm is ready for the (m + l)th iteration because the 
induction hypothesis of G' for the next iteration is satisfied 
now. 

At the beginning of the first iteration where no leaf of G has 
yet been identified and, therefore, G' is empty, we can first 
create a vertex as the root of G ' , associate it with a new array 
WW*!]) such that W^\i\ = p,,rn; Blftlf 
and then follow the general instructions as described above. 

Example 2: Consider again the 3-unit system in Example 1 
where the corresponding ordered tree C was depicted in Fig. 
1 . To identify 0 £ -o.35» the algorithm grows a subgraph G ' of G 
step by step. (Note that the algorithm does not create G. We 
show it here only for illustration.) In Figs. 2(a)-3(a), we 
show, respectively, the G' at iterations 1-2 immediately 
before backtracking. We also show in Figs. 2(b)-3(b), 
respectively, the G' at iterations 1-2 after backtracking. The 
unique path traversed from the root to a leaf at each iteration is 
highlighted by double-circled nodes. Only 4 iterations are 
required in this case and the algorithm generates (in order) 
outputs a.ojs = {U. 1. DO, 3, 1)(2, 1, 1)(1, 2, 1)}. □ 

A complete description of the algorithm is given next. For a 
currently being visited vertex u of G' at level / with weight w 
and address (y u yi, * ■ * , yi) where it is known that T u in G 
has a leaf (referred to as v when found) which is the heaviest 
among all the remaining leaves of G, we now show a recursive 
procedure SEARCH(w, /, id, NEW). This procedure can grow 
G' to build the unique path, say fl/(w), aj+ lt * • * , a n {v), from u 
to u, identify the address (y\, y 2y * * *, y„) of v, and then 
update }V Ci [y t +i], i = n — 1, *•*,/. It finally returns a value 
NEW which is the weight of the heaviest remaining leaf 
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W|l)*,ai/ \W[2]..0T 




w(]|.o/ \w(2]..or 



Fig. 2, (a) C at iteration 1 after backtracking, (b) C at iteration 1 after 
backtracking. 




Fig. 3. (a) C at iteration 2 after backtracking, (b) C at iteration 2 after 
backtracking. 

among T u of G after v has been found. There are two possible 
cases. 

Case I: I = n (i.e., u is a leaf), The procedure SEARCH(«, 
/, w, NEW) will simply print the contents of ( y l9 y Zt * • • , y„) 
and the return with NEW = 0. 

Case II: I < n (i.e., u is an internal vertex). If u is visited 
for the first time, procedure SEARCH(w, /, w, NEW) will first 



create and initialize W u [i] = u)'p t +i t rTIj mt+2 Pj t i for each / = 
1, • • • , Ni+i. Choose k such that wj*] = max ( W'ji], 
W u [N t + If the *th son of u has not yet been created in G ' , 
create vertex u ' as the kth son of w, and establish a link from u 
to u' . The procedure then proceeds to visit the Jtth son of w, 
let> /+l - k t recursively call SEARCH(«' f / + 1, 
NEW') to build the path a, + I (w')> a f+2t • • • , a n (v) from u' to 
u t identify the address ( y lt y 2t • • • , of u, update W^ # .[^/ + ] ], 
i = n - !,•*•,/+ 1, and finally return a value NEW ' 
which indicates the weight of the heaviest remaining leaf 
among the kth subtree of u in G after u has been identified. As 
soon as SEARCH(w ' » / + 1 , w •/?/+ NEW') is returned, the 
procedure will update JfJ/c] with NEW' and return with 
NEW = max{fV M Uh W u [N l+l ]). 

With the procedure SEARCH(w, /, <*>, NEW), the main 
routine for printing IJ ( only needs to create a vertex as the root, 
then repreatedly call SEARCH( root, 0, 1, NEW) until the 
weight sum of all the leaves identified so far becomes greater 
than or equal to 1 - e. A formal description of the procedure 
SEARCH and main routing is as follows: 

Procedure SEARCH(«, /, w, NEW) 
Step I) if / = n then 
begin 

print (yi,y 2 , *••,%); 
SUM - SUM +■ w; 
NEW - 0; 
return 

end; 

Step 2) if u has not been visited before then 

create and let W u [i] - wpt+i 1] for 

/ = 1, yv /+ i; 
Step 3) choose k such that W u [k] = max ( W u [\] t * • ■ , 

Step 4) if the kth son of u has not yet been created then 
begin 

create a vertex w' as the kth son of w; 
establish a link from w to u ' 
end; 

Step 5) y, + l - k; 

visit w' (that is, the kth son of u) via the link 
established before; 

call SEARCH(u\ w*p /+t *, NEW'); 

Step 6) W u {k] «- NEW'; 

NEW +- max ( WJA/^,)); 
Step 7) return. 

Main Routine 

Input: e and p f j for / = 1,2, ••■,/! and 

j = 1,2, . .,7V,; 
Output: 0 ( ; 

Step 0) create an n vector ( y u y Xt •••,%); 
SUM - 0; 
Pdn] - 1; 

for I — n — i to 0doP,[/] «- p^xyPAl + 1]; 
create a vertex root; 
Step I) repeat 

visit root; 

call SEARCH(roof, 0, 1, NEW) 
until SUM 2= 1 - c; 

Step 2) stop. 

Note that the value of variable SUM in the algorithm 
corresponds to the accumulated probability of the most 
probable states enumerated so far. Observe that the complexity 
is dominated by Step I) in the main routine. At each iteration 
of Step I)y n + 1 vertices of G f at different levels 
(corresponding to the vertices on the unique path from the root 
to the heaviest remaining leaf in G) are visited and/or created 
where the algorithm spends 0(7V/ +I ) operations on the vertex 
at level I < n and spends 0(n) operations on the vertex at 
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level n for printing an w-vector. Since Step I) has |Q e | 
iterations, the time complexity of the algorithm is 

o(|Q,l" + |0,l -2"') 

= o(|0,| • ^N^£0(\Q,\nN) 

where N - max (N lf N 2 , * * * , N n ). 
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Effect of Ordering the Codebook on the Efficiency of 
the Partial Distance Search Algorithm for 
Vector Quantization 

K. K. PAUWAL AND V. RAMASUBRAMANIAN 



Abstract— Recently, Bel and Gray have used a partial distance search 
algorithm which reduces the computational complexity of the minimum 
distortion encoding for vector quantization. In the present correspon- 
dence, the effect of ordering the codevectors on the computational 
complexity of the algorithm is studied. It b shown that the computational 
complexity of this algorithm can be reduced further by ordering the 
codevectors according to the sizes of their corresponding dusters. 

I. Introduction 

Vector quantization has become recently very popular in a 
number of areas such as speech coding, image coding and 
speech recognition [1], [2], However, the utilization of vector 
quantization is severely limited by the computational complex- 
ity of the encoding process. The minimum distortion encoding 
of a test vector, using the conventional full -search algorithm 
for a codebook of size N ( = 2 KR , K being vector dimension 
and R the bit rate) requires N vector-distance computations. 
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For the squared -error distance measure, the number of 
multiplications required per sample for the encoding is N, i.e. , 
the size of the codebook, which increases exponentially with 
dimension K and bit rate R. The encoding process, therefore, 
becomes prohibitively expensive for large-size codebooks. In 
order to reduce the computational complexity of the minimum 
distortion search in a vector quantization encoding process, the 
partial distance search method has been recently proposed [3] , 
[4]. In this method, a codevector can be rejected on the basis 
of partial distance, i.e., without completing the total distance 
computation. 

In the present paper, our aim is to study the effect of 
ordering the codevectors on the savings offered by the partial 
distance search. To maximize this savings, we propose an 
ordering of codevectors according to the sizes of their 
corresponding clusters. In addition, we note that the code- 
books obtained at the end of the training process using the 
clustering algorithms such as the Linde-Buzo-Oray algorithm 
[51 have arbitrary orderings and are never guaranteed to be 
arranged in the favorable order. We demonstrate this and 
show that an explicit ordering of the codevectors as proposed 
here is advantageous for deriving more computational savings 
from the partial distance method. 

n. Algorithm 

Let C = {c it i = 1, • • ■ , N} be a codebook of size N 
where C/ « (c#, j = 1 , • • • , K ) is a K dimensional vector. For 
a given test vector X - (x jt j = 1 , • * * , K ), it is required to 
find the minimum distortion codevector q(X) from the 
codebook C under the squared-error distortion measure 
defined as 

d{X, C / )=S(JC;-C tf ) J . 

The basic structure of the partial distance search algorithm 
[4] is as follows: 

rfmin - 00 (a very large number). 

Loop A: For i - 1, ■ • *, AT 
d = 0 

Loop B: For/ = 1, A" 

d « d - C</) Z 

if d > daaa -» next / (exit condition) 
next j 

dmin = d 

min = / 

next i. 

It can be observed that the partial-distance search algorithm 
gains computation saving over the full search algorithm 
because of the provision for a premature exit from Loop B on 
satisfying the condition d > d min (called the exit condition 
henceforth) before the completion of the distance computation 
d{X, a). 

For a given test vector X, the maximum saving is obtained 
when the exit condition is satisfied as early as possible inside 
Loop B for each c t . This in turn is facilitated by the condition 
that the minimum possible value of d^ a for a given X is 
obtained earliest in Loop A. Since data has the minimum 
possible value for a given X only after the actual nearest 
codevector of X has been checked, it is possible to obtain the 
maximum saving in computation by having the first codevec- 
tor as the nearest codevector of X, i.e., Cj «= q{X). This is 
obviously not . practical since prior knowledge of q(X) 
obviates the need for search. However, a probabilistic solution 
can be given to improve the computation saving. 
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INTRODUCTION 

The problem of detecting one out of Af , possibly fad- 
ing, signals immersed in additive noise can be posed as 
an M-axy hypothesis testing problem: the received sig- 
nal, after baseband demodulation, is foi warded to the 
decision circuits, which are eventually to decide which 
one, between the M alternative hypotheses, is actually 
in force. In other words, if the t-th alphabet symbol is 
transmitted, namely under hypothesis if;, then the base- 
band equivalent of the received signal is 

r(t) = i4«p(j0)pi(t)+n(t) *= 1.....M (1) 

wherein pi( ) is the complex envelope of the i-th wave- 
form, A txp(j9) is a complex gain accounting for the pos- 
sible channel fading, and n(t) is the complex envelope of 
the noise, which is typically modelled as a complex, pos- 
sibly correlated stochastic process. For equally likely, 
equi-energy signals, the detection scheme (see figure 1) 
consists of a bank of M filters, each matched to one of 
the admissible waveforms, followed by an envelope detec- 
tor: the maximum of these envelopes is henceforth used 
for making the final decision [1]. 

The rationale for adopting such a scheme is two-fold: 
on one hand, in fact, it turns out to achieve minimum 
erroi probability - uniformly with respect to the prob- 
ability density function (pdf) of A - for the case that 
the noise can be modelled as a white, complex Gaussian 
process and the signal phase 0 is uniformly distributed 
in (-x,r). On the other, for orthogonal signal set with 
non-fluctuating amplitude, this detector is also asymp- 
totically optimum for increasingly high signal set sire 
M, in the sense that arbitrarily reliable communication 
can be achieved, provided the signal- to- noise per bit ex- 
ceeds a threshold level, often referred to as Shannon limit 
[1]. Unfortunately, in many situations of practical inter- 
est the Gaussian assumption for the noise is no longer 
valid. Relevant deviations from the Gaussian law, espe- 
cially in the high-amplitude tail of the measured noise 
distribution, are observed both for man-made noise [2,3] 
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Figure 1: Scheme of the envelope detector 

and for atmospheric noise [3,4,5]: in other words, such 
a disturbance turns out to be impuhive, in the sense 
that noise spikes occur with higher probability than pre- 
dicted by the Gaussian pdf. The classical envelope de- 
tector is no longer optimum in this case in either sense 
specified above: on one hand, in fact, it is not ensured 
to achieve minimum error probability for finite M , on 
the other its performance may not attain the Shannon 
limit for M — ► oo. As to the former point, recent studies 
[6,7,8] demonstrated that, if the impulsive noise is mod- 
elled as a compound-Gaussian process, then the classical 
envelope detector is still optimum, for any set of equally 
likely, equi-energy signals and for any amplitude fluctu- 
ation law: in other words, such a detector is canonical 
with respect to both the noise marginal distribution - 
provided it belongs to the class of compound-Gaussian 
pdf 's - and the pdf of A. 

This paper is aimed at investigating the latter point, 
namely at studying the impact of the noise distribution 
on the limit performance: in fact, even though the sit- 
uation M — ♦ oo is a merely ideal one, since it assumes 
communication with no finite- bandwidth constraint, it 
is of relevant theoretical interest to assess whether in- 
creased signal space dimensionality - and, hence, larger 
transmission bandwidth - can be still traded at will for 
enhanced communication reliability. 
More precisely, in next section, after shortly reviewing 
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the noise model, we summarise some relevant results 
about detection in compound- Gaussian noise; in the last 
section we outline analytical procedures for evaluating 
the eiror probability for Af — ary orthogonal signalling; 
in particular, we give a general relationship, holding true 
for any noise pdf as well as signal amplitude fluctua- 
tion law, showing that even in non-fading channels noise 
spiky n ess dramatically affects the limit performance. 

DETECTION IN IMPULSIVE NOISE 
The optimum - in the usual sense of achieving minimum 
error probability • decision rule for discriminating be- 
tween M equally likely hypotheses is well known to be 
the Maximum Likelihood (ML) rule; therefore, the design 
of the optimum detector requires the specification of the 
received signal under the M alternative hypotheses - or, 
equivalently, of the noise process - in order to calculate 
the relevant likelihood functional [lj. Consequently, the 
need arises for a statistical model of the noise baseband 
equivalent h(i): generally speaking, such a model should, 
on one hand, be mathematically tractable, on the other, 
it should be physically grounded, namely it should repre- 
sent a faithful mathematical description of the physical 
phenomena that give rise to the noise. 
A class of processes that fulfill both requirements is that 
of the exogenous processes, wherein the noise is regarded 
as the product of two mutually independent components, 
namely 

n(t) = ,(t)g(t) (2) 

The process g(t) is a zero- mean, complex, Gaussian, pos- 
sibly correlated process, whilst s(t) is a real, non-negative 
process, whose average decorrelation time is heretoafter 
assumed to be much longer than the decorrelation time 
of g(t). This model was first introduced and validated 
by Hall [4] with reference to atmospheric noise and sub- 
sequently widely referred to as a suitable one for many 
sources of impulsive noise [3,5]. Notice also that if the 
coherence time of s{t) is much longer than the signalling 
interval, then the noise process (2), as observed at the 
receiver, degenerates into a spherically symmetric pro- 
cess, namely into the product of the Gaussian process 
g(t) times a random constant *, i.e.: 

n(t) = sg(t) (3) 

This approximation results in considerable mathematical 
simplifications, since the process (3) can be shown to be 
amenable to a complete statistical characterisation based 
solely upon knowledge of its marginal pdf and covariance 
function [6]. 

Unfortunately, not all the marginal pdf's are compatible 
with the representations (2) and (3); in fact, notice that, 
by the total probability law, the common marginal pdf 



of the noise quadrature components, / n (z) say, should 
identically satisfy the relationship 

wherein c 2 denotes the power of g{t) as well as of the 
overal! oise process: in the above equation, indeed, we 
ass unit i, without loss of generality, that s possesses unit 
mean square value. Therefore, in order that the process 
n(t) be admissible as a compound- Gaussian one, it is 
necessary and sufficient that a pdf f{s) exist satisfying 
the integral equation (4). 

However, all the most credited models for the noise 
marginal pdf can be shown to be admissible as compound 
Gaussian processes; among them we cite [6] 

- the Generalised Gaussian , namely 

where a > 0 is related to the variance a 2 as a 3 = 
g2 ?l?ly) antl v > 0 k a ^ape parameter. The ad- 
missibility range of the shape parameter of this pdf 
has been shown to be 0 < v < 2. 

- the Generalised Laplace , namely 

/n(l) = (t 1 )" 1/Ijr "-W«N). (6) 

where u > 0 is again a shape parameter, whilst a > 0 
is related to the variance a 2 as a 2 = |^ . Unlike the 
previous pdf, the Generalised Laplace is admissible 
in the whole range of its shape parameter. 

- the Generalised Cauchy , namely 

'•<■> = 2!: 9s$ 2 <°• "•)--". (0 

where a > 0 is still a scale parameter, while u > 0 is 
a shape parameter: such a distribution is admissible 
for any v. 

- the Middleton Class- A . namely 

"■>-?7=F-(-i5y » 

wherein $i are discrete values that s can take on with 
probabilities e, . In Middleton 's Class- A model these 
parameters assume the form 

-i/"* 7 i/u + X A 

fi = e a *• = iTx"' , = 0 ' 1 -- < 9 > 

where u is a positive shape parameter - also referred 
to as impulsive index - and A is the ratio of the power 
of the Gaussian component of the noise to that of 
the impulsive (Poisson) component. 
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Based on the model described above, we can proceed 
to the solution of the M— ary detection problem. By 
applying the usual procedures of the statistical detection 
theory, it can be shown that, insofar as the noise is white, 
the test admits the dot products 



*(«)#?(«)* 



i= l,...,Af 



as sufficient statistics [7,8], which leads to the optimum 
test 



(10) 



then choose Mi 



namely to the conventional detector of figure 1. We 
stress here that the detector (10) is canonically optimum 
irrespective of the pdf /(«) * and hence, of the noise 
marginal pdf - in the class of spherically symmetric im- 
pulsive noise as well as of the fading law f(A) and the 
transmitted equi- energy-signals constellation: this is a 
straightforward, although not trivial, consequence of the 
noise model we adopted and, especially, of the assump- 
tion that the process s(i) is approximately constant over 
the signalling interval. 

Notice however that the above arguments rely on the 
hypothesis that the noise process is white and would no 
longer apply, should n(t) possess an arbitrary correlation 
function. On the other hand, the process (3) is easily 
seen, based upon the scale property of linear systems, 
to be closed under linear transformations, in the sense 
that any linear operation leaves unaffected the marginal 
pdf of n(i). Consequently, one might conceive to whiten 
the noise by linearly filtering the received signal: there- 
fore, the problem of detecting the waveforms pi{t) in the 
presence of correlated disturbance can be reduced to that 
of detecting their filtered version, $(<) say, in uncorre- 
cted noise. However, upon filtering, these waveforms are 
no longer ensured to be equi-energy, whence such a test 
would no longer admit the detected envelopes as suffi- 
cient statistics: if, however, based upon the knowledge 
of the noise correlation, the transmitted waveforms are 
suitably scaled so as to ensure that qi(t) i = 1, . . ., Af 
is a set of equi-energy signals, the test (10) is still canon- 
ically optimum. 

ASYMPTOTIC PERFORMANCE 
The analytical evaluation of the error probability for 
arbitrary signal constellation, subject to uncorrelated 
spherically symmetric impulsive noise, can take great ad- 
vantage of some known results about detection in Gaus- 
sian noise. In fact, conditioned upon s, the noise process 
is a mere Gaussian one: thus, if we denote by P{e\s) the 



error probability under such a disturbance, the uncondi- 
tional performance is then obtained by simply averaging 
s out of P(e\a). 

The performance of the test (10) -for finite Af V subject 
to spherically symmetric uncorrelated noise has been as- 
sessed in [6,7,8] in the general case. Here we focus on the 
assessment of the asymptotic error probability: to this 
end, we first assume that the signal constellation con- 
sists of a set of M equally likely, orthogonal, equi-energy 
signals and then we let M diverge to infinity. 
The conditional performance for finite Af over non- 
fluctuating channel is expressed as [1] 



wherein A tm , is the channel root mean square (rms) 
gain and fR is the signal-to-noise ratio per symbol, 
y R - £ P /a 2 % related to the signal-to-noise ratio per bit 
76 as fit = 7*log2 Af. The error probability subject to 
impulsive, possibly fading channel is obviously obtained 
by averaging (11) with respect to A and s. 
Starting upon relationship (11), the limit performance 
can be obtained by letting M diverge to infinity: 

1 if #^<\n2 
Poo(«M)= { • A ~'' (12) 

0 otherwise 

which confirms that arbitrarily reliable communication 
on non-fluctuating Gaussian channel (i.e., for A = A rm $ 
and s = 1) can be achieved for any signal- to- noise ratio 
above the Shannon limit In 2 (-1.6 dB), provided that the 
signal space has infinite dimensions. 
In order to investigate the impact of impulsive noise and 
possible fluctuations in the signal amplitude, we are to 
average s and A out of (12). To begin with, assume that 
the channel is impulsive and deterministic. Averaging 
(12) with respect to s yields 



Pco(e) = r°u{.-yS)/(.)ii 



(13) 



wherein u(-) denotes the unit step function and Fs(-) 
is the cumulative distribution function of the random 
constant s. Relationship (13) highlights that impulsive 
noise results into smoothing of the Shannon limit curve, 
which no longer exhibits the stepwise transition between 
completely unreliable and fully reliable communication; 
conversely, one might expect that the heavier the tails 
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of the complementary cumulative distribution 1 — Fs(') 

- namely the spikier the noise -, the smoother the tran- 
sition. 

As working examples, we considered two cases of relevant 
practical interest, namely 

• Compound- Gaussian noise with Generalised LapLacian 
marginal distribution, implying 

/(4) = j£)' i '" ltx? i-'" r > < M > 

- Compound- Gaussian noise with Middle ton Class- A 

marginal distribution, implying 

/« = £««(•-*) 05) 

i 

where <$(■) denotes the Dirac's delta function. 
Correspondingly, the limit performance is written as 

*-w =, -r£r (*••£*) (16) 

for the Generalised Laplace, and 

Me) = l-S>«G/£-«) W 

for the Middleton Class- A: in the above equations, 7( ; •) 
is the incomplete Eulerian Gamma function. 
Sample curves are reported in figures 2a and 2c, for 
Laplacian and Middleton Class- A marginal pdf, respec- 
tively (dot-dash curves): in the latter model, we assumed 
A — 0, namely we nullified the Gaussian component in 
favour of the impulsive one. In both cases, decreas- 
ing values of the shape parameters - implying heavier 
and heavier tails in the noise distribution - result into 
smoother and smoother curves; conversely, as the shape 
parameters tend to infinity, namely as both the Gener- 
alised Laplacian and the Middleton Class- A distributions 
approach their Gaussian limit, the performance curves 
tend toward the well-known Shannon limit pertaining to 
Gaussian channels. 

The curve referring to Middleton class-A pdf deserves 
some further comments. First, notice the stepwise be- 
haviour of foo(e) with increasing signal-to-noise ratio 
per bit: this is obviously due to the discrete nature of 
the modulating random constant 5. Somewhat surpris- 
ing might appear the behaviour of P <x ,(e) as y b vanishes, 
since the error probability does not tend to unity. In 
fact, it can be easily shown that, for Middleton class-A 
pdf, we have 

^lim o Poo(c) = 1 - eo = 1 - e~" (18) 



This is a straightforward consequence of the neglect for 
the Gaussian component of the noise, namely of the as- 
sumption A = 0: in fact, this implies a nonzero probabil- 
ity of no noise in the observed signal (see equations (8) 
and (15)). 

In order to investigate the effect of the fading, we re- 
lax the hypothesis of non-fluctuating channel. Thus, a 
general expression for the limit performance subject to 
impulsive noise and fading can be achieved by averaging 
(12) with respect to A and s, which yields: 

^-i-jf^^^/M" (,„ 

The above relationship holds true for arbitrary fading 
and noise marginal distribution: on one hand, it repre- 
sents a generalisation of (13), to which it reduces for the 
special case f(A) = 6(A - Ar mt )\ on the other, it also 
generalises a similar result recently established by Cre- 
pe au [9] with reference to white Gaussian noise. In fact, 
by inverting the order of integration, equation (19) can 
be re-cast in the form: 

Poo (e) = 1 - J™ F A [sA rmi <J^J f(s) ds (20) 

where now Fa() denotes the cumulative distribution 
function of the fading. 

As a general comment, the main effect of the fading is 
seen to be a further smoothing of the limit curve: in other 
words, the deeper the fading, the slower the convergence 
to zero of Poo(c). 

In order to give a quantitative insight into the effect of 
the fading depth, we first have to select a model for the 
fluctuation law f(A): a reasonable and quite general one 
is the so-called Nakagami-m family of distributions, i.e.: 

where m is a shape parameter ruling the fading depth, 
and is precisely the ratio of the fading standard deviation 
to its root mean square value. In other words, the larger 
m, the more constrained the fluctuation: such a family 
subsumes some cases of relevant practical interest, such 
as the Rayleigh channel (m = 1), the one- plus-dominant 
Rayleigh (m = 2) and the non-fluctuating (m = oo). 
As worked example we again consider the two cases of 
spherically symmetric noise with Generalised Laplace 
and Middleton Class-A marginal pdf; for both cases, 
an explicit expression for the asymptotic performance in 
Nakagami-m fading channel can be derived by substi- 
tuting the corresponding expressions for F${-) into (21), 
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which yields 

'-W-^rwrw (22) 

for the Generalised Laplace, and 

, v v - ^ €i ( s?mln2\ 

^w=ErRH m: ^r-) (23) 

for the Middleton Class- A. In figures 2a and 2c the effect 
of the fading depth is represented for Laplacian and Mid- 
dleton Class- A marginal pdf, respectively and for several 
values of the shape parameter i/: the limiting cases of 
non-fluctuating channel (m = oo) and Rayleigh channel 
(m — 1) are reported only. As general comments, it is 
easily seen that, in both cases, the deeper the fading, the 
smoother the transition between unreliable and fully reli- 
able communication. Further, for extremely deep fading, 
as, for example, observed on Rayleigh channels (m = 1), 
the asymptotic performance is primarily ruled by the fad- 
ing law, and is practically unaffected by the particular 
noise model being in force. In order to further investi- 
gate the joint effect of the fading and of the noise spiky- 
ness, we. refer to figures 2b and 2d, where, for the same 
noise distributions, the limiting performance is reported 
for several values of m and the two limiting situations 
of highly spiky (i/— 0.1) and Gaussian (v = oo) noise: 
as expected, for both noise models and for any imple- 
mented value of the shape parameter v, the deeper the 
fading, the smoother the limit curve, which admits the 
two cases m = oo and m = 1 as lower and upper bound, 
respectively. 
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Fig. 2 Asymptotic performance in impulsive noise. 



a) Laplacian pdf for several i/'s: 

b) Laplacian pdf for several m's: 

c) Middleton class- A pdf for several v's: 
b) Middleton class-A pdf for several m*s: 



• Rayleigh fading, 
•i' = 0.1, 
■ Rayleigh fading, 
- */ = 0.1, 



non-fluctuating, 
i/ = oo. 

non fluctuating. 



— • — ■ — V — oo. 
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